Tamil IT ! : Interactive Speech Translation in Tamil
نویسنده
چکیده
The Tamil IT! (Interactive Translation) speech translation system is intended to allow unsophisticated users to communicate across the Tamil ↔ English language barrier, without strong domain restrictions, despite the error prone nature of current speech and translation technologies. Achieving this ambitious goal depends in large part on allowing the users to interactively correct recognition and translation errors. We briefly present the Multi Engine Machine Translation (MEMT) architecture, describing how it is well suited for such an application. We then describe our incorporation of interactive error correction throughout the system design. We are currently in the process of developing a Tamil ↔ English system based on this architecture. A Brief Overview of Tamil Language Analysis Like any other language analysis process, Tamil language analysis also involves morphological analysis, syntax analysis and semantic analysis. Tamil is a Morphologically rich language. Most of the grammatical functions are embedded into the word in the form of inflections. Morphological Analysis Here is an example of Tamil morphological analysis. eeRineen eeRu in een (climbed) (Verb) (Past tense) (I Person+Singular+Neuter) In the above example the word eeRineen (climbed) has three morphemes viz.(1) eeRu [Verb for climb], (2) in [Past tense marker] and (3) een [GNP marker]
منابع مشابه
Translating Tamil Speech (SL) as English Text Message (TL) in Android Mobile Phones
Mobiles phones are used every nook and corner and every man, hence innovative technological applications are needed. Moreover, in the scenario of android mobiles not only professionals but even common users expect ample innovations. The paper focuses on translating Tamil speech (SL) as English text message (TL). Even though there are some applications used for translating SL to TL, its one step...
متن کاملThe Transition of Phrase based to Factored based Translation for Tamil language in SMT Systems
Machine translation is one of the major and the most active areas of Natural language processing. Machine translation (MT) is an automatic translation of one natural language into another using computer generated instructions. The utility and power of Statistical Machine Translation (SMT) seems destined to change our technological society in profound and fundamental ways. The current state-of-t...
متن کاملIntegrating Machine Translation and Speech Synthesis Component for English to Dravidian Language Speech to Speech Translation System
This paper provides an interface between the machine translation and speech synthesis system for converting English speech to Tamil text in English to Tamil speech to speech translation system. The speech translation system consists of three modules: automatic speech recognition, machine translation and text to speech synthesis. Many procedures for incorporation of speech recognition and machin...
متن کاملIssues in developing LVCSR System for Dravidian Languages: An exhaustive case study for Tamil
Research in the area of Large Vocabulary Continuous Speech Recognition (LVCSR) for Indian languages has not seen the level of advancement as in English since there is a dearth of large scale speech and language corpora even today. Tamil is one among the four major Dravidian languages spoken in southern India. One of the characteristics of Tamil is that it is morphologically very rich. This qual...
متن کاملAutomatic Conversion of Dialectal Tamil Text to Standard Written Tamil Text using FSTs
We present an efficient method to automatically transform spoken language text to standard written language text for various dialects of Tamil. Our work is novel in that it explicitly addresses the problem and need for processing dialectal and spoken language Tamil. Written language equivalents for dialectal and spoken language forms are obtained using Finite State Transducers (FSTs) where spok...
متن کامل